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Abstract 

We find the generating function for C(n, k, r), the number of compositions of n into 
k positive parts all of whose runs (contiguous blocks of constant parts) have lengths less 
than r, using recent generalizations of the method of Guibas and Odlyzko for finding 
the number of words that avoid a given list of subwords. 

1 Introduction 

A composition of an integer n is a representation = ai + 02 + ■ ■ ■ + in which the parts 
are positive integers, and where the order of the parts is important. Thus 9 = 1 + 1 + 1 + 4 + 2 
is one of the compositions of n = 9, and 9 = 4 + 1 + 1 + 2 + 1 is another. 

A run in a composition is a maximal string of consecutive identical parts. The composi- 
tion 

28 = 3 + 5 + 5 + 5 + 3 + 3 + 4 

has run lengths of 1,3,2,1, for example. In this note we find (the generating function of) 
C(n, k, r), the number of compositions of n into k parts, whose runs all have lengths < r (see 
Theorem [3] below) , by using recent generalizations of the Guibas-Odlyzko theory of counting 
words that avoid a given list of subwords. 

In their 1981 paper [1], Guibas and Odlyzko gave an elegant solution to the following 
counting problem. Given an alphabet A, and a list C of words over that alphabet, the list 
being reduced in the sense that no word on the list is a subword of any other. How many 
words of length n do not contain any of the words in £ as a subword? Other solutions of 
this problem have been given by the cluster method of Goulden and Jackson [2], and by 
Zeilberger's pTO] method of counting words that avoid "mistakes." 



The results of [1] have recently been extended by A.N. Myers Pj to the situation wherein 
the letters of the alphabet are assigned weights, the weight of a word is the sum of the 
weights of its letters, and one is to find the number of words of weight n that avoid the 
members of the list C This allows us to solve problems involving compositions of integers 
as well as problems that do not involve compositions. 

Finally, Myers's results have been complemented by Heubach and Kitaev [3] to provide 
the number of words of length k and weight n that avoid the members of the list C, though 
their theorems are restricted to the alphabet {1,2, ... ,n} and therefore apply almost exclu- 
sively to integer compositions. 

The above theorems present the generating function for the desired numbers of words as 
the first component of the solution vector of a system of linear, simultaneous equations, or, 
by using Cramer's rule, as a ratio of two determinants. 

The main point of this note is the following. The easy case in such word problems is 
the case in which every pair of distinct words on the forbidden list C has correlation 0, in 
a sense to be explained below, or equivalently, for every pair x, y of distinct words on that 
list, no suffix of X is also a prefix of y. In that situation, the matrix of coefficients of the 
system of linear equations that expresses the answer to the question has a very simple form. 
It consists of a nonzero first row and first column and main diagonal, all other entries being 
O's. 

For a matrix of that form it is easy to write out the solution of the governing system of 
linear equations simply and explicitly. We will do that below and then find the generating 
function for C{n,k,j), the number of compositions of n into k parts the lengths of whose 
runs is at most j. 

2 The main theorem 

Let X and Y be two words over a given alphabet. We define the correlation cxy of X on 
Y, as follows. 

• Write the word X above the word Y, aligned so that the rightmost letter of X is above 
the rightmost letter of Y. 

• Fix some integer j > 0. Shift Y j places to the left, so the rightmost letter of Y is now 
under the {j + l)st letter of X, counting from the right. 

• Examine the subword of X that now overlaps with Y. This is the maximal prefix of 
X that has letters of the shifted Y below it. 
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• If that subword of X is identical with the subword of Y that hes below it, take Cj = 1, 
else take Cj = 0. 

• Having done this for all j, the correlation of X on y is the binary vector C0C1C2 .... 

For example, if X = 110 and Y = 1011 then cxy = Oil and cyx = 0010, in which we have 
written the bits of the c's in the order cqCi . . . Cm-i- 

Let each letter u of the alphabet be assigned a weight w{u), and let the weight of a word 
be the sum of the weights of its letters. Finally, if X is an m- letter word X = aoOi . . . a^-i, 
define the correlation polynomial cxy{x, q) of X on F to be 

cxy{x, q) = co + cix'"'^''^~'\ + C2x'"("™-2'*™-i)g2 + ■ ■ ■ + c™_ix"'("i"2-"™-^)g'™~^ (1) 

The main result of [5], which extends the main result of [9J, which in turn extends the main 
result of is the following. 

Theorem 1 (Heubach, Kitaev) Let L = {Si, . . . , S^} be a list of integer compositions, 
such that no composition on the list is contained in any other. Let F(x,q) = Y2^x'^^'^^q^^'^\ 
the sum being extended over all compositions of all integers that avoid every word on the list 
C, where i{a) is the length of the word (number of parts of) a and w{a) is the sum of the 
parts of a. Then F{x,q) is the component Xi of the solution vector of the following system 
of linear equations: 



f 1 — x{l + q) 1 — X 



1 — X \ / Xi \ / 1 — X \ 

X2 



... -Cik{x,q) 
V a:"'(^'=)g^(^'=) -Cki{x,q) ... -Ckk{x,q) J \ Xk+i J 








(2) 



3 The easy case 

We now specialize to the case where Cij{x,q) = for all i ^ j, 1 < i,j < k, k being the 
length of the forbidden word list C. The coefficient matrix entries in the equations ([2]) then all 
vanish except for those in the first row, the first column, and the main diagonal. For any such 
matrix, B, say, the first entry of the solution vector of the equations i?x = (1 — x, 0, ... , 0)"^ 
is easily verified to be 

1 — X 

xi = - 



^11 



h ^21 

'^12 6;; 



k + l 



If we apply this result to the equations ([2]) we obtain 
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Theorem 2 Let C = {Si, . . . , Sk} be a list of integer compositions, such that no word on the 
list is contained in any other. Suppose further that all correlation polynomials Cij{x, q) = 0, 
for i 7^ j. Let F(x,q) = Y^^x^^'^^q^^'^\ the sum being extended over all compositions a that 
avoid every word on the list C, where i{a) is the length of the word a. Then we have the 
explicit formula 

^(^' i) = — Ms-yw- (3) 

1 qx I \r^K X ^ o'q'- J' 
l-x ~'~ ^i=l Cj^j{x,q) 



4 Carlitz compositions and beyond 
4.1 Carlitz compositions 

We apply the resuhs of the previous section to finding the distribution function of the lengths 
of the longest runs of integer compositions. Again, a run in a composition is a maximal string 
of identical parts. The composition 28=3+5+5+5+3+3+4 has run lengths of 1,3,2,1, for 
example. 

A Carlitz composition is one all of whose runs have length 1. That is, a Carlitz com- 
position is one in which no two consecutive parts are equal. These compositions have been 
extensively studied in recent years, both exactly and asymptotically [HlZllH]. The machinery 
of "the easy case" above counts Carlitz compositions of n, as follows. 

The list C of forbidden subwords is C = {11,22,33,44, . . .}. A Carlitz composition is 
evidently one that avoids this list, and also evidently, this list belongs to the easy case, i.e., 
the off-diagonal correlation polynomials all vanish. Thus we can use Theorem [21 

The word Sj is jj, and its weight is w{Sj) = 2j. The correlation polynomials cs^s- vanish 
for all i j, while for i = j we have by ([1]), 

CsjSjix,y) = 1 + x^q. 

If C{n, k) is the number of Carlitz compositions of n into k parts, we now have from equation 

EC(n k)x^a^ = ^ 
1 _ _|_ ^2 x^i 
n,k 1-x)^^ Z^j>l l+qxJ 

= l + qx + qx"^ + {q + 2q^)x^ + (g + 2q^ + q^)x^ + (g + V + 2q^)x^ + ... 

This generating function has previously been found, in somewhat different form, by Knopf- 
macher and Prodinger [7j. 
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4.2 Beyond 

Now we find the distribution function of the maximum run length in compositions of n that 
have k parts. 

Let C{n, k, r) denote the number of compositions of n into k parts that have no run of 
length > r. Note that C{n,k,2) counts Carlitz compositions of n with k parts. To find 
C(n, k, r) we use the list C = {V , T', S**, . . . } of forbidden words, where, e.g., 1'^ is a string 
of r I's. Then again the list C qualifies for "the easy case," since the correlations all vanish 
off of the diagonal, while on the diagonal, 

cs^s,{x, y) = l + x^q + x^^q^ + ■■■ + x^'^^'^^q^'^ = Iz^. 

1 — qx^ 

We now have from equation ([3]), 

Theorem 3 The number C{n,k,r) of compositions of n into k parts that have no run of 
length > r has the generating function 

y C{n, k, r)x^q^ = --. (4) 

n,k ^ TH^ "T 1 2^j>l l-q^x^J 

When r = 3 we have 

^ C(n, k, 3)a;"g^' = l + qx+{q + q^)x^ + {q + 2q^)x^ + {q + 3?^ + ?>q^)x'^ + . . . , 

n,k 

and for r = 4, 

^ C(n, k, 4)xV = l + qx + {q + q^)x^ + [q + 2?^ + q'^)x^ + {q + 3?^ + 3g^)x^ + . . . . 

n.k 

The average length of the longest run in a composition of n has been found to be ~ log2 n, 
by Grabner et al [3], using the method of i.i.d. geometric random variables. 
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